Combining statistical alignment and phylogenetic footprinting to detect regulatory elements
نویسندگان
چکیده
MOTIVATION Traditional alignment-based phylogenetic footprinting approaches make predictions on the basis of a single assumed alignment. The predictions are therefore highly sensitive to alignment errors or regions of alignment uncertainty. Alternatively, statistical alignment methods provide a framework for performing phylogenetic analyses by examining a distribution of alignments. RESULTS We developed a novel algorithm for predicting functional elements by combining statistical alignment and phylogenetic footprinting (SAPF). SAPF simultaneously performs both alignment and annotation by combining phylogenetic footprinting techniques with an hidden Markov model (HMM) transducer-based multiple alignment model, and can analyze sequence data from multiple sequences. We assessed SAPF's predictive performance on two simulated datasets and three well-annotated cis-regulatory modules from newly sequenced Drosophila genomes. The results demonstrate that removing the traditional dependence on a single alignment can significantly augment the predictive performance, especially when there is uncertainty in the alignment of functional regions. AVAILABILITY SAPF is freely available to download online at http://www.stats.ox.ac.uk/~satija/SAPF/
منابع مشابه
Identification of Regulatory Elements Using Comparative Genomics and Phylogenetic Footprinting
With the complete compilation of several genomic sequences, understanding the regulation of gene activity has become one of the primary goals for the Molecular Biology community. This research has the objective of identifying regulatory elements of human genes using Phylogenetic Footprinting. All the biological data used comes from NCBI databases(HomoloGene, EntrezGene, EntrezNucleotide), diffe...
متن کاملCALL FOR PAPERS Comparative Genomics Identifying cis-regulatory elements by statistical analysis and phylogenetic footprinting and analyzing their coexistence and related gene ontology
Shi W, Zhou W, Xu D. Identifying cis-regulatory elements by statistical analysis and phylogenetic footprinting and analyzing their coexistence and related gene ontology. Physiol Genomics 31: 374–384, 2007. First published September 11, 2007; doi:10.1152/physiolgenomics.00085.2006.—Discovery of cis-regulatory elements in gene promoters is a highly challenging research issue in computational mole...
متن کاملCONREAL: conserved regulatory elements anchored alignment algorithm for identification of transcription factor binding sites by phylogenetic footprinting.
Prediction of transcription-factor target sites in promoters remains difficult due to the short length and degeneracy of the target sequences. Although the use of orthologous sequences and phylogenetic footprinting approaches may help in the recognition of conserved and potentially functional sequences, correct alignment of the short transcription-factor binding sites can be problematic for est...
متن کاملA taxonomy-traversing approach to discover cis-acting elements in prokaryotes
The increasing number of sequenced genomes opens promising avenues to apply comparative genomics in order to detect phylogenetically conserved cis-acting elements, and to study their divergence across taxonomy. This approach, called phylogenetic footprinting is based on the hypothesis that, due to selective pressure, regulatory elements tend to evolve at a slower rate than surrounding non-codin...
متن کاملWhole Genome Human/Mouse Phylogenetic Footprinting of Potential Transcription Regulatory Signals
UNLABELLED Phylogenetic footprinting is an efficient approach for revealing potential transcription factor binding sites in promoter sequences. The idea is based on an assumption that functional sites in promoters should evolve much slower then other regions that do not bear any conservative function. Therefore, potential transcription factor (TF) binding sites that are found in the evolutional...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 24 10 شماره
صفحات -
تاریخ انتشار 2008